Importance of Secondary Structure Elements for Prediction of Go Annotations

نویسندگان

  • Aslı Filiz
  • Eser Aygün
  • Özlem Keskin
  • Zehra Cataltepe
چکیده

Predicted or actual protein secondary structure, in addition to amino acid sequence, is often used for fold recognition and function prediction. Different kinds of secondary structure elements could be predicted with different accuracy by different prediction methods and this could affect the fold or function prediction performance. In this study, contribution of amino acid sequence residues belonging to different types of secondary structure elements (H: alpha helix, E: beta sheet, L: loop) for protein function prediction is investigated. Smith-Waterman alignment similarity scores between amino acid sequences belonging to 6 different sets of secondary structure elements, namely, HEL, HE, HL, H, E and L, are computed. Using these alignment scores, protein function prediction is performed. On a function prediction data set, consisting of 27 Gene Ontology (GO) classes and 4498 sequences, it is found out that using the whole amino acid sequence results in the best performance. Using H and L regions together results almost as well performance as HEL. E regions alone are the least significant in function prediction.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches

DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...

متن کامل

Prediction of Secondary Structure of Citrus Viroids Reported from Southern Iran

Abstract Viroids are smallest, single-stranded, circular, highly structured plant pathogenic RNAs that do not code for any protein. Viroids belong to two families, the Avsunviroidae and the Pospiviroidae. Members of the Pospiviroidae family adopt a rod-like secondary structure. In this study the most stable secondary structures of citrus viroid variants that reported from Fars province wer...

متن کامل

The use of gene ontology evidence codes in preventing classifier assessment bias

MOTIVATION The biological community's reliance on computational annotations of protein function makes correct assessment of function prediction methods an issue of great importance. The fact that a large fraction of the annotations in current biological databases are based on computational methods can lead to bias in estimating the accuracy of function prediction methods. This can happen since ...

متن کامل

Seismic Response of Building Structures with Sliding Non-structural Elements

Interaction between a structure under base excitation and heavy non-structural elements that it supports is significant in the seismic analysis and design of the structure. Heavy non-structural elements may slide/rock under base excitation, and this dynamic action affects the seismic behavior of the supporting structure. Hence, in this study, a numerical model was presented to describe the seis...

متن کامل

AUTOMATED FUNCTION PREDICTION Enhanced automated function prediction using distantly related sequences and contextual association by PFP

The impetus for the recent development and emergence of automated function prediction methods is an exponentially growing flood of new experimental data, the interpretation of which is hindered by a shortage of reliable annotations for proteins that lack experimental characterization or significant homologs in current databases. Here we introduce PFP, an automated function prediction server tha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008